Evaluations - per - move Wins

نویسندگان

  • Hans J. Berliner
  • Chris McConnell
  • David Carmel
چکیده

24 and close to that achieved by. Berliner and McConnell 1] deal with the diiculty in nding optimistic and pessimistic bounds on the real values of game-tree positions. However, their results are not applicable for multi-model search where we look for bounds on the sum of two functions. The sum-bounds for a multi-model search reeect the maximal possible diierence between the subjective evaluations done by the player and its opponent model. In Section 5 we presented practical methods for computing such bounds. The question whether the beneet of modeling outweighs the cost of reduced pruning depends on the domain and on the evaluation function used. In this work we showed an example, in the checkers domain, where the beneet of modeling outweighs the cost of reduced pruning power. The multi-model framework allows the use of arbitrary opponent models. This work points out that pruning is signiicantly reduced for a model which is radically diierent from the player's strategy. However, for zero-sum games, we do not expect the players to evaluate boards in a radically diierent way. In such cases, the pruning power of is signiicant. A uniied theory of heuristic evaluation functions and its application to learning. Table 1: The results obtained by an iterative deepening version of when played against an iterative deepening version of. Both algorithms were allocated the same search resources { leaf evaluations per move. Each row represents a tournament of 1,000 games. The last two columns show the average search depth of the two algorithms. resources per move. We measured the search resources by the number of leaf evaluations available for a move. Table 1 shows the results obtained by , with the player P 1 = (f 1 ; (f 0 ; ?)), against , with the function f 0 , for various values of resource limit (leaf evaluations per move). Each row represents the results of a tournament of 1,000 games. We can see that in all cases searched shallower than due to its reduced pruning power. However, the deeper search of was outweighed by the beneet of modeling. When playing against an player using f 0 , , using the player P 1 = (f 1 ; (f 0 ; ?)), uses the exact opponent function as a model, in contrast to which wrongly uses ?f 0 as an opponent model. As a result, while the player prefers exchange positions, wrongly assumes …

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Retrospective Evaluations of Gambling Wins: Evidence for a ‘Peak-End’ Rule

This study explored retrospective evaluations of temporally extended outcomes. Participants, primarily non-gamblers or recreational gamblers, played a fruit machine that presented sequences of payouts and then retrospectively evaluated their experiences. Experiments 1 and 2 replicated a robust “peakend” effect on estimation judgments and choices, with participants exhibiting biases in favor of ...

متن کامل

Planarity, Colorability, and Minor Games

Letm and b be positive integers and let F be a hypergraph. In an (m, b) Maker-Breaker game F two players, called Maker and Breaker, take turns selecting previously unclaimed vertices of F . Maker selects m vertices per move and Breaker selects b vertices per move. The game ends when every vertex has been claimed by one of the players. Maker wins if he claims all the vertices of some hyperedge o...

متن کامل

Martin Gardner and Wythoff ’ s Game

What’s a question to your answer? We will not settle this puzzle here, yet we’ll taste it. But let’s begin at the beginning, namely in 1907, when Willem Abraham Wythoff [7], a Dutch mathematician, invented the game we analyze here, explained vividly by Martin Gardner in [5]. The game is played with two piles of tokens, each pile containing an arbitrary number of tokens. A move consists of eithe...

متن کامل

Vertex-to-vertex pursuit in a graph

A graph G is given and two players, a cop and a robber, play the folioking game: the cop chooses a vertex, then the robber chooses a vertex, then the players move alternately beginning with the cop. A move consists of staying at one’s present vertex or moving to an adjacent vertex; each move is seen by both players. The cop wins if he manages to occupy the same vertex as the robber, and the rob...

متن کامل

Graph Ramsey games

We consider combinatorial avoidance and achievement games based on graph Ramsey theory: The players take turns in coloring still uncolored edges of a graph G, each player being assigned a distinct color, choosing one edge per move. In avoidance games, completing a monochromatic subgraph isomorphic to another graph A leads to immediate defeat or is forbidden and the first player that cannot move...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1998